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ABSTRACT 

We present morphological classifications obtained using machine learning for ob- 
jects in SDSS DR6 that have been classified by Galaxy Zoo into three classes, namely 
early types, spirals and point sources/artifacts. An artificial neural network is trained 
on a subset of objects classified by the human eye and we test whether the machine 
learning algorithm can reproduce the human classifications for the rest of the sample. 
We find that the success of the neural network in matching the human classifications 
depends crucially on the set of input parameters chosen for the machine-learning al- 
gorithm. The colours and parameters associated with profile-fitting are reasonable in 
separating the objects into three classes. However, these results are considerably im- 
proved when adding adaptive shape parameters as well as concentration and texture. 
The adaptive moments, concentration and texture parameters alone cannot distinguish 
between early type galaxies and the point sources/artifacts. Using a set of twelve pa- 
rameters, the neural network is able to reproduce the human classifications to better 
than 90% for all three morphological classes. We find that using a training set that is 
incomplete in magnitude does not degrade our results given our particular choice of 
the input parameters to the network. We conclude that it is promising to use machine- 
learning algorithms to perform morphological classification for the next generation of 
wide-field imaging surveys and that the Galaxy Zoo catalogue provides an invaluable 
training set for such purposes. 
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1 INTRODUCTION 

Classification of galaxies has been a long-term goal in as- 
tronomy (e.g. Ivan den Berghl (1 19981 ) and references therein). 
While classification by human eye is still common, there have 
been several attempts to use machine learning techniques. 
For example, lLahav et af] (|l995l . Il99rj ) showed that artifi- 
cial neural networks can successfully reproduce visual clas- 
sifications. In recent years, artificial neural networks have 
gained prominence as a succesful tool for calculating pho- 
tome tric redshifts e.g. |ColIister fc Lahavll2004l ; iFirth et al.l 
120031 ) particularly wit h regard to the n e xt generation 
of ga laxy surveys e.g. |Banerii et al.l 120081 ; lAbdalla et al.l 
120081 ). However, artificial neural networks were first ap- 
plied to as tronomical data sets in order to classify stel 



lar spectra (|von Hippel et al. 

and galaxy morphologies ( Storrie-Lombardi et al.l 



1994 iBailer- Jones et al.ll 



1998) 



1992 



iNaim et all 19951 ; iFolkes et all 19961 ). Astronomical data sets 
have grown considerably in size in the last decade owing 
largely to the advent of mosaic CCDs that can be used on 
large telescopes in order to image large areas of the sky down 
to very faint magnitudes . The Sloan Digital Sky Survey 
(SDSS) (|York et al.ll2000l ) has led to the construction of a 
data set of around 230 million celestial objects. This data set 
has already been used for morphological c lassification usin g 
automated machine learning techniques (|Ball et alj|2004l ). 
Future generations of wide-field imaging surveys such as the 
Dark Energy SurvejQ, PanStarrijf] and LSST0will reach new 
limits in terms of the size of astronomical data sets. Clearly, 
automated classification algorithms will prove invaluable for 
the analysis of such data sets but these algorithms are yet 
to be applied on such scales. 

The Galaxy Zoo project launched in 2007 has led to 
morphological classification of nearly 1 million objects from 
the SDSS D R6 through visual inspection by more than 
100,000 users (|Lintott et al.ll200Sl) . This project has resulted 
in a remarkable data set that can be used for studies of the 
formatio n and subsequent ev olution of galaxies in our Uni- 
verse. In lLintott et alj (|2008l ) . the Galaxy Zoo classifications 
have been compared to those by professional astronomers 
showing that there is remarkable agreement between clas- 
sifications by members of the general public and the pro- 
fessionals. The biases in the Galaxy Zoo classi fications have 
been studied in detail bv lBamford et all |2009h who go on to 
correct for these biases and use the dataset to study the re- 
lationship between galaxy morphology, colour, environment 
and stellar mass. This data set however also presents us with 
the unique opportunity to compare human classifications to 
those from automated machine learning algorithms, on an 
unprecedented scale. If the neural network is shown to be as 
successful as humans in separating astronomical objects into 
different morphological classes, this could save considerable 
time and effort for future surveys while ensuring uniformity 
in the classifications. 
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In this paper we explore the ability of artificial neural 
networks to classify astronomical objects from the SDSS into 
three morphological types - early types, spirals and point 
sources/artifacts. In §[2] we describe the Galaxy Zoo cata- 
logue. In § O the artificial neural network method is pre- 
sented. §2] details the different choices of input parameters 
that are used for classification. We present our results in § 
[5] and draw some conclusions in § [6j 



2 THE GALAXY ZOO CATALOGUE 

Galaxy Zoo is a web-based project that aimed to obtain 
morphological classifications for roughly a million objects 
in the Sloan Digital Sky Survey by harnessing the power 
of the internet and recruiting members of the public to 
perform these classifications by eye. The first part of this 
project is now complete and the morphological classifica- 
ti ons subsequen t ly ob tained have been described in detail 
in lLintott et al.l (|2008l ) where these classifications have also 
been shown to be credible based on comparison with classifi- 
cations by professional astronomers. The classifications have 
also been used in a number of interesting science papers e.g. 
in the identification of a sample of blue early type galaxies 
in the nearby Universe (|Schawinski et aill2009l) and to s tudy 
the spin statistics of spiral galaxies ( Land et alj|2008l ) and 
the power of this data set is proving e normous for studies o f 
both galaxy formation and evolution (|Bamford et aljfeoogl ). 
Our goal is now to assess whether morphological classifica- 
tions such as those from Galaxy Zoo can be reproduced for 
even larger data sets likely to become available with the next 
generation of galaxy surveys through the use of automated 
machine learning algorithms such as artificial neural net- 
works. Before we proceed, we caution the readers that the 
Galaxy Zoo catalogue is not represented by a simple selec- 
tion function as is the case for both volume and flux-limited 
samples. It contains objects from both the Main Galaxy 
Sample (MGS) as well as the Luminous Red Galaxy (LRG) 
sample of the SDSS and as such over represents the number 
of distant red galaxies in the Universe. This is not particu- 
larly important for the aims of this paper as we are simply 
attempting to reproduce the human classifications using ma- 
chine learning. However, when using these classifications for 
scientific analysis, further cuts could be applied in order to 
remove this bias. 

The Galaxy Zoo catalogue that we use in thi s pape r 
is the combined weighted sample of iLintott et al.l (|2008l ) . 
This contains morphological classifications for 893,212 ob- 
jects into four morphological classes - ellipticals, spirals, 
mergers and point sources/artifacts. Note that the only stars 
in the Galaxy Zoo catalogue are bright stars with halos or 
diffraction spikes which are thus interpreted as extended ob- 
jects by the automatic star/galaxy separation criteria and 
the only point sources included are those objects whose spec- 
tra are best fit by galaxy templates. The classification by 
each user on each object is weighted such that users who 
tend to agree with the majority are given a higher weight 
than those who don't. The final morphology of the object 
is then a weighted mean of the classifications of all users 
who analys ed it. Full de t ails o f the weighting scheme are 
provided in lLintott et all (|2008l ). The weighted catalogue of 
objects contains the SDSS object ID and three additional 
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columns with the weighted fraction of vote of the galaxy 
being an elliptical, spiral or point source/artifact - i.e. clas- 
sified as don't know by Galaxy Zoo users - between and 
1. Any star- forming irregular galaxies are also put into this 
don't know class by the Galaxy Zoo users. If the sum of the 
fractional votes in each of these three classes is less than 
1, the remaining fraction of vote is assigned to the merger 
class. Note that this data set is affected by a luminosity, 
size and redshift dependent classification bias as is the case 
for most morp h ologie s derived from flux-limited data sets. 
iBamford et al.l (|2009h have derived corrections to remove 
this classification bias from the data. However, in this paper 
we work with the original catalogue of morphologies as the 
classification biases are not particularly important for the 
aims of this paper. 

We match the Galaxy Zoo catalogue to the SDSS DR7 
PhotoObjAlH catalogue in order to obtain input parame- 
ters for the neural network code. These input parameters 
are described in detail in § [4] Before input into the neural 
network, we apply cuts on our sample and remove objects 
that are not detected in the g, r and i bands and those that 
have spurious values and large errors for some of the other 
parameters used in this study. iDarg et al.l ||2009h have al- 
ready discussed issues to do with merger classification within 
Galaxy Zoo and constructed a sample of ~3000 merging 
pairs from the Galaxy Zoo data. In this paper, we classify 
objects as ellipticals, spirals or point sources/ artifacts us- 
ing ou r machine learning code and note that the lDarg et al.l 
|2009l ) data set may be used in future for the classification of 
mergers although this has not been attempted in this paper. 
We therefore also remove the few well classified mergers with 
a fraction of vote of being a merger greater than 0.8 from 
the sample as we are not attempting to classify the mergers 
in this work. This leads to a sample of ~800,000 objects. 
Further cuts are then applied to define a gold sample where 
the fraction of vote for each object belonging to any one 
of three morphological classes - ellipticals, spirals and point 
sources/artifacts, is always greater than 0.8. This gold sam- 
ple contains ^315,000 objects and i s esse ntially equivalent 
to the clean sample of lLintott et al.l (I2008T ). The neural net- 
work is run on the gold sample as well as the entire sample. 

It is also the case that faint disky objects are more 
likely to be classified as ellipticals unless the spiral arms can 
be clearly seen. The elliptical sample therefore also proba- 
bly contains a reasonable number of lenticular systems and 
we therefore refer to this morphological class as early types 
throughout this paper. We therefore also consider a sample 
of objects with r < 17 that is defined as our bright sam- 
ple and should suffer from a lower level of contamination in 
the early type class. This magnitude limit is the same as 
that impo sed on object s that were used to determine user 
weights in lLintott et al.l (|2008T ). The bright sample contains 
~340,000 objects and has fewer "well classified" early types 
than the gold sample as discussed later. 
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Figure 1. Cartoon schematic of how both the human eye as well 
as machine learning algorithms such as artificial neural networks 
perform morphological classification and determine parameters 
such as those listed in Table [l] and [2] from the galaxy images. 



3 ARTIFICIAL NEURAL NETWORKS 
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19881: iBishod Il995l; lLahav et alj 1 19951 ; iNaim et aL. „ 
Collister fc Lahavl I2004T ) for classification in this paper. It 



has already been shown that, on a de Vaucouleurs type sys- 
tem, T, which spans values from -5 to 10, human expert clas- 
sifiers agree to rms At = 1.8, and that such agreement can 
be obtained by a neural netw ork when trained on the classifi- 
catio ns of one of the experts l|Lahav et alJI 19951 : INaim et al.l 
Il995h . The neural network used in our study is made up 
of several layers, each consisting of a number of nodes. The 
first layer receives the input parameters described in detail 
in §[4] and the last layer outputs the probabilities for the ob- 
ject belonging to the three morphological classes. All nodes 
in the hidden layers in between are interconnected and con- 
nections between nodes i and j have an associated weight, 
Wij. A training set is used to minimise the cost function, E 
(Eq. [TJ with respect to the free parameters Wij : 



E = y^ j (T NN (w i j,p k ) - T eyejfc ) 2 



(1) 



5 Note that the SDSS object IDs correspond for objects in DR6 
and DR7 



where Tnn is the neural network probability of the object 
belonging to a particular morphological type, pk are the in- 
put parameters to the network and T eye ,k are the fractional 
weighted votes in the training set in this case assigned by 
Galaxy Zoo users. 

If the data is noisy or the network is very flexible, a 
validation set may be used in addition to the training set 
to prevent over-fitting. During the initial setup, one has to 
specify the architecture of the neural network - the number 
of hidden layers and nodes in each hidden layer. We choose a 
neural network with two hidden layers with 2N nodes each, 
where N is the number of input parameters. The architecture 
of the network is therefore N:2N:2N:3. Note that increasing 
the number of nodes further either by adding nodes to ex- 
isting hidden layers or adding more hidden layers to the 
network, does not result in any substantial improvement to 
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the classifications. The three nodes in the output layer give 
the probability of the galaxy being an early type, spiral and 
point source/artifact respectively between and 1. Neural 
nets with this type of output are statistical Bayesian esti- 
mators and therefore the sum of all thr ee outputs is r oughl y 
although rarely exactly equal to 1 fe.g. lLahav et all (| 1996h : 
Appendix C and references therein). Note that this differs 
from the Galaxy Zoo fractional votes which always add up 
to exactly 1 over all four morphological classes - early types, 
spirals, point sources/artifacts and mergers. As mentioned 
earlier, the mergers are not classified by the neural network 
in this paper. 



Name 


Description 


derecLg-derecLr 

dered_r-dered_i 

deVAB.i 

expAB_i 

lnLexp_i 

InLdeV.i 

lnLstar_i 


(g-r) colour 
(r-i) colour 
dc Vaucoulcurs fit axis ratio 
Exponential fit axis ratio 
Exponential disk fit log likelihood 
de Vaucouleurs fit log likelihood 
Star log likelihood 



Table 1. First Set of Input Parameters based on colours and 
profile fitting 



4 INPUT PARAMETERS 

When using an automated machine learning algorithm, the 
choice of input parameters may be crucial in determining 
how well the network can perform morphological classifi- 
cations. Ideally, one wishes to choose a set of parameters 
that show marked differences across the three morphologi- 
cal classes. In addition, it may be useful to define a set of 
parameters that is independent of the distance to the object. 
For example, colours may be used instead of magnitudes and 
ratios of radii could replace individual radius estimates. Fig- 
ure [T] illustrates the key role that the input parameters play 
in the morphological classification. The human eye sees an 
image and performs morphological classifications. The same 
image is also used to derive input parameters such as those 
that will be discussed in this section. The neural network 
uses these parameters, as well as a training set based on 
the human classifications to derive its own morphological 
classifications, Tnn- These are then compared to the hu- 
man classifications, T eye in order to assess the success of the 
machine-learning algorithm in reproducing what is seen by 
the human eye. 

In this section, we consider two sets of input parameters 
based on these criteria but make no additional effort to fine- 
tune and optimise these parameters to perform this mor- 
phological classification. The first set of parameters listed in 
Table[T]have been used extensively in the l iterature for mor - 
phological classification in the SDSS e.g. (|Ball et al.ll2004 ). 
They include the (g-r) and (r-i) colours derived from the 
dereddened model magnitudes although these have not been 
k-corrected to the rest-frame. We do not use the k-corrected 
colours as this would require a redshift to be measured for 
the object and therefore reduce the total number of objects 
in our sample. Furthermore, the vast majority of objects in 
future large-scale photometric surveys will not have secure 
spectroscopic redshifts and we aim to assess how effective 
machine-learning is as a tool for morphological classifica- 
tion in such surveys. The other parameters considered are 
the axis ratios and log likelihoods associated with both a 
de Vaucouleurs and exponential fit to the two-dimensional 
galaxy image. The de Vaucouleurs profile is commonly used 
to describe the variation in surface brightness of an ellipti- 
cal galaxy as a function of radius whereas the exponential 
profile is used to describe the disk component of a spiral 
galaxy. In addition, the log likelihood of the object being 
well fit by a PSF, InLstar, helps in distinguishing extended 
galaxies from more point-like sources. Note that the sample 
only contains those objects that are well-fit by a PSF that 



also have spectra that are best-fit by a galaxy template. 
Stars have already been removed from the sample. The (g- 
r) and (r-i) colours have been chosen as the images used in 
the Galaxy Zoo classifications were composites of images in 
these three bands. All other parameters correspond to the 
i-band images only. 

The distribution of the (g — r) colour, de Vaucouleurs fit 
axis ratio and log likelihood fit to a de Vaucouleurs profile for 
the different morphological classes, is illustrated in Figure [5] 
where we plot the fraction of gold early types, spirals and 
point sources/artifacts as a function of these parameters. 
These histograms are constructed using only objects with a 
fraction of vote greater than 0.8 in each of the three classes 
from Galaxy Zoo i.e. the gold sample. This threshold is ar- 
bitrary and certainly results in the loss of many true early 
types and spirals from our sample. However, choosing a high 
threshold also ensures that the samples we use to construct 
histograms don't suffer much from contamination. Through- 
out the rest of this paper, the Galaxy Zoo early types, spirals 
and point source/artifacts always refer to the gold sample 
with a fraction of vote greater than 0.8. Note that the frac- 
tion of vote is different from the classification probability 
although the two are highly correlated. Many objects with 
fractional votes less than 0.8 are also well classified early 
types or spirals but in this paper we only choose to consider 
those objects that are the most cleanly classified by mem- 
bers of the public for checking against the corresponding 
neural network classifications. 

We can immediately see from Figure [2] that different 
parameters allow us to distinguish between different mor- 
phological classes. As expected, early types are found to be 
redder than spirals whereas the point sources and artifacts 
have a wide range of colours. The axis ratio obtained from 
a de Vaucouleurs fit to the galaxy images is closer to unity 
for early type systems (typically ~0.8) compared to spirals 
(typically ~0.3) and has a bimodal distribution for the point 
sources and artifacts. The log likelihood associated with the 
de Vaucouleurs fit is also larger for the early types than the 
spirals and largest for the point sources and artifacts. 

The second set of input parameters described in Ta- 
ble do not use the colours of galaxies or any parameters 
associated with profile fitting for morphological classifica- 
tion. Instead a new set of shape and texture parameters are 
used as well as the concentration. The concentration is given 
by the ratios of radii containing 90% and 50% of the Pet- 
rosian flux in a given band. mRrCc is the second moment of 
the object intensity in the CCD row and column directions 
measured usin g a scheme designed to h ave optimal signal- 
to-noise ratio (IBernstein fc Jarvidl2002h . 
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Figure 2. Histograms of some of the input parameters in Tables ^ and [2] for the gold sample of early types, spirals and point 
sources / artifacts . 
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mRrCc =< y 2 > + < x 2 > 



(2) 



where x and y are image coordinates relative to the centre 
of the object in question and, for example, 



<y >- 



Y,I{y,x)w{y,x)y 2 
Y J I{y,x)w{y,x) 



(3) 



Moments are measured using a radial gaussian weight 
function, w(y,x) iteratively adapted to the shape and size of 
the object. The ellipticity components, mEl and mE2 defined 
in Eq[4]and Eq[5]and a fourth order moment (Eq[6]l are also 
specified. 



mEl 



<y 2 > 



< x " > 



mRrCc 



mE2 = 2 



< xy > 
mRrCc 



mCr4 



<(y 2 + x 2 f> 



(4) 



(• r >) 



(6) 



where a is the size of the gaussian weight. We use the el- 
lipticity components to define the adaptive ellipticity, aE for 
input into the neural network where: 



aE = 1 - 



1 - VmEl 2 + mE2 2 
1 + VmEl 2 + mE2 2 



(7) 



The final parameter in T able [2] is the text u re or coarse- 
ness parameter described in lYamauchi etafl (|2005l ). This 
essentially measures the ratio of the range of fluctuations 
in the surface brightness of the object to the full dynamic 
range of the surface brightness and is expected to vanish 
for a smooth profile but become non-zero if structures like 
spiral arms appear. 

The distribution of the concentration, adaptive elliptic- 
ity and texture parameters constructed using only the gold 
objects in the Galaxy Zoo catalogue with a fraction of vote 
greater than 0.8, are shown in Figure [2] The concentration 
parameter is larger for early types compar ed to spirals. This 
is con sisten t with the previous s tudies by IShimasaku et al.l 
|200ll ) and IStrateva et all (|200lh who find the inverse con- 
centration index to be larger for spirals than for ellipticals. 
The adaptive ellipticity is large for the spirals, small for the 
early types and slightly smaller still for the point sources 
and artifacts. The texture parameter although roughly sim- 
ilar for the three morphological classes, is still slightly larger 
for spirals as compared to the early types and point sources 
suggesting that the latter have the smoothest surface bright- 
ness profiles. 

We summarise the results of running the neural network 
with these different choices of input parameters in the next 
section. 



5 RESULTS 

The neural network code is run using three different sets of 
input parameters - (i) colours and profile-fitting parameters 



Name 


Description 


petroR90_i/petroR50_i 

mRrCc_i 

aE_i 

mCr4_i 

texture_i 


Concentration 
Adaptive (+) shape measure 
Adaptive Ellipticity 
Adaptive fourth moment 
Texture parameter 



Table 2 . Second Set of Input Parameters based on adaptive mo- 
ments 



from Table [T] (ii) concentration, adaptive shape parameters 
and texture from Table [5] and (iii) a combined set of twelve 
parameters. We also define three samples on which the neu- 
ral network is run. The first sample is the entire catalogue of 
~800,000 objects out of which 50,000 are used for training 
and 25,000 for validation. This is found to be a sufficiently 
large training set for these purposes and no significant im- 
provement in the classifications is seen on increasing the size 
of the training set further. The second sample defined as the 
gold sample contains only objects with a fraction of vote 
greater than 0.8 assigned to them in any one of the three 
morphological classes by Galaxy Zoo users. This is essen - 
tially the same as the clean sample in lLintott et al.l (|2008h . 
This gold sample contains ~315,000 objects and once again 
50,000 are used for training and 25,000 for validation. In 
this gold sample, ~65% are early types, ~30% are spirals 
and ~5% are point sources and artifacts. These fractions 
should not however be interpreted as the true ratio of dif- 
ferent morphological types in the Universe as the fraction of 
early types is much higher in a flux-limited sample such as 
this. In addition, the choice of an arbitrary fractional vote 
threshold of 0.8 for morphological classifications in Galaxy 
Zoo, means many true spirals and ellipticals do not make 
our cut. We also do no t attempt to remove any classifica- 
tion bias as was done in lBamford et all (|2009h as this is not 
particularly important for the aims of this paper. Finally, 
we consider a bright sample with r < 17 only. In this bright 
sample, ~55% are early types, ~40% are spirals and ~5% 
are point sources and artifacts and we can therefore see that 
the bias towards early types is somewhat reduced on remov- 
ing the faint galaxies from the sample. 



5.1 The Entire Sample 

In this section we summarise the results of running the neu- 
ral network on the entire sample of objects using the three 
different sets of input parameters. In Figures 2] and [SI we 
plot the NN probability of the object belonging to a mor- 
phological class versus the percentage of genuine objects in 
that class that are discarded on applying this probability 
threshold as well as percentage of contaminants that en- 
ter the sample. This allows us to determine the optimum 
probability threshold for the neural network that should be 
chosen for membership into each morphological class. This 
threshold is such that the number of contaminants equals 
the number of genuine objects in that class that are dis- 
carded. This is found to be 0.71 for early types, 0.50 for 
spirals, 0.24 for point sources/artifacts when using the pa- 
rameters in Table [T] 0.68 for early types, 0.50 for spirals, 
0.10 for point sources/artifacts when using the parameters 
in Table [2] and 0.73 for early types, 0.58 for spirals and 
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0.26 for point source/artifact when using the combined set 
of parameters. 

For the Galaxy Zoo classifications, we consider the frac- 
tional vote threshold to be 0.8 for the galaxy belonging to a 
particular morphological class. In Tables [3][4] and [5l we sum- 
marise the results of running the neural net on the entire 
sample using the three different sets of input parameters. 
The tables give the percentage of early types, spirals and 
point sources/artifacts in Galaxy Zoo that are put into the 
different classes by the neural network after assuming the 
optimum NN probabilities already mentioned in each of the 
three classes. Throughout this paper, the lowercase names 
- Early Type, Spiral, Point Source/Artifact - correspond to 
the Galaxy Zoo classifications and the upper case names - 
EARLY TYPE, SPIRAL, POINT SOURCE/ARTIFACT - 
correspond to the neural network classifications. Note that 
as we are attempting to minimise both the number of con- 
taminants as well as the number of genuine objects discarded 
in each of the three classes, the total number of objects is 
not necessarily conserved. There may also be a significant 
number of poorly classified objects with a spread of likeli- 
hoods over two or more morphological classes. These poorly 
classified objects will have a probability of less than the 
chosen optimum threshold probability in all three morpho- 
logical classes. Some of these objects that are poorly classi- 
fied by the neural network may have been well classified by 
Galaxy Zoo which is why the sum of the columns in Table 
[3] are typically less than 100%. Some objects may also be 
put into more than one class due to the assumed neural net- 
work threshold probabilities. For example, an object with 
a neural net probability of greater than 0.1 in the POINT 
SOURCE/ ARTIFACT class, could potentially, also have a 
neural net probability of greater than 0.5 in the SPIRAL 
class and therefore be classified as both. This is why the 
sum of the columns in Table [4] are typically greater than 
100%. In the tables presented throughout this analysis, we 
are not summarising the results of unique classifications for 
every single object but rather the best results that could 
be obtained for each subsample assuming neural network 
threshold probabilities that result in the same number of 
contaminants into the sample as genuine objects that are 
discarded. If we were to consider the best results for the 
entire sample as a whole, the chosen probability thresholds 
would probably be different. 

Using the traditional colour and profile-fitting param- 
eters, 87% of early types, 86% of spirals and 95% of point 
sources/artifacts are correctly classified by the neural net- 
work. We note however that while colours are sensitive to 
the star-formation history of a galaxy, the morphology es- 
sentially measures the dynamic history. Although the two 
are correlated, they are not necessarily the same and there- 
fore it is important to use colours in conjunction with other 
parameters when performing morphological classifications. 
Looking more closely at the Galaxy Zoo early types that 
were misclassified by the neural net as spirals as well as the 
spirals that are misclassified by the neural net as early types, 
there is some evidence that these galaxies may be red spirals 
or blue ellipticals. 45 Galaxy Zoo early types are classified as 
spirals with a probability of greater than 0.8 by the neural 
net. Similarly, 40 Galaxy Zoo spirals are classified as early 
types with a probability greater than 0.8. Out of these, 21 
early types and 9 spirals have SDSS spectra available. Using 



the criterion in lBaldrv et al.l (|2004h to isolate red and blue 
galaxies, we find that 6 out of the 9 spirals are red and 10 
out of the 21 early types are blue. In other words, the colour 
information used as an input to the neural network may be 
biasing the morphological classification. However, due to the 
small numbers of misclassified galaxies with SDSS spectra 
available, a definite statement cannot be made and we leave 
this to future work. 

The adaptive shape parameters are very good for distin- 
guishing between spirals and early types and these parame- 
ters result in accurate classifications for 84% of early types 
and 87% of spirals. However, this set of parameters gives 
very poor results for point source/artifact and only 28% are 
correctly classified by the neural network. This is because, 
these parameters are very similar for point sources/artifacts 
and early types as can be seen in some of the histograms in 
Figure [2] The spirals on the other hand have very different 
adaptive shape parameters. As there are many more early 
types in the training set compared to point sources/artifacts, 
the neural network cannot differentiate between the two and 
assigns most of the point sources/artifacts to be early types. 
Also Figure [4] shows that the optimum neural network prob- 
ability for point source/artifact classification using this set 
of input parameters is as low as 0.1. This means that many 
objects will be classified both as a point source and a galaxy 
assuming this probability and for this reason, the sums of 
the columns in Table [3] are always greater than 100% as 
there are objects in common between the classes. We also 
investigate whether the evidence for a bias due to colour in 
the morphological classifications is removed once the colours 
are removed as input parameters to the network. With the 
adaptive shape parameters as inputs we find 208 Galaxy Zoo 
early types to be misclassified as spirals with a probability 
greater than 0.8 by the neural network and similarly, 26 
spirals are misclassified as early types with the same proba- 
bility. Out of these, 116 early types and only 8 spirals have 
SDSS spectra. 2 out of the 8 spirals are red but 46 of the 
116 early types are blue once again suggesting that there 
may still remain a bias, especially against blue ellipticals 
even when the colour parameters are removed as inputs to 
the neural net. This may suggest that atleast some blue el- 
lipticals have more structure than their red counterparts. 
However, we emphasise that the sample sizes here are too 
small to make any quantitative statement about the extent 
of this bias and we defer this to future work. 

On adding the profile-fitting and colour parameters to 
the adaptive shape parameters, the results are now consid- 
erably improved for all three classes. 92% of early types, 
92% of spirals and 96% of point sour ces/artifacts a r e cor - 
rectly classified by the neural network. iLintott et al.l (|2008l ) 
have compared the Galaxy Zoo classifications to those by 
professional astronomers and find an agreement of bet- 
ter than 97 % between these sam ples. However, the sam- 
pl es used in ILintott et al.1 (|2008h are the MOSES sample 
of ISchawinski et al.l ( 20071 ) whose objective was to generate 
a very clean set of elliptical galaxi es, and the detailed clas- 
sifications of lFukugita et akl (|2007i ) which are very sensitive 
to the E/Sa boundary. If the professional astronomers were 
set the same task as the Galaxy Zoo users - i.e. a clean 
division of spirals and ellipticals in SDSS - the scatter be- 
tween them and the Galaxy Zoo users may well be worse 
than this. We have shown that with a set of twelve intelli- 
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Figure 3. The neural network probability of a galaxy being an early type (top left), spiral (top right) and point source/artifact (bottom) 
versus the percentage of contaminants as well as the percentage of Galaxy Zoo objects in these classes that are discarded. These results 
are obtained using the 7 input parameters in Table [TJ based on colours and traditional profile- fitting. 
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Table 3. Summary of results for entire sample when using input parameters specified in TablcfTI- colours and traditional profile-fitting. 



gently chosen parameters, that are easily available but still 
by no means optimal for performing morphological classifi- 
cation, the neural network results agree to better than 90% 
with those from Galaxy Zoo. It can therefore certainly be 
expected that by using a set of better tuned input parame- 
ters to the network, the machine learning algorithm will be 
able to classify galaxies with comparable or less scatter than 
that produced by human classifications. 

5.2 The Gold Sample 

We now describe the results of running the neural network 
on our gold sample using the three different sets of input pa- 
rameters. These results are summarised in Tables [6l[7| and [8] 



where we consider the percentage of Galaxy Zoo early types, 
spirals and point sources/artifacts that have also been put 
into these classes by the neural network assuming a proba- 
bility of greater than 0.8 now for the neural network classifi- 
cation. Once again it can be seen that the second set of input 
parameters involving just the concentration, texture and the 
adaptive shape parameters, performs very poorly in classify- 
ing the point sources and artifacts. However, the combined 
set of input parameters leads to correct neural network clas- 
sifications for 97% of early types and 97% of spirals, on par 
with the agreement between Galaxy Zoo and professional 
astronomers. The success for point sources/artifacts is some- 
what lowered compared to using the entire sample. This is 
because when considering the entire sample we assume that 



Galaxy Zoo: Morphology via Machine Learning 9 




0.2 0.4 0.6 0.8 1 

NN Probability of Being Point Source 



Figure 4. The neural network probability of a galaxy being an early type (top left), spiral (top right) and point source/artifact (bottom) 
versus the percentage of contaminants as well as the percentage of Galaxy Zoo objects in these classes that are discarded. These results 
are obtained using the 5 input parameters in Table [2] that use adaptive moments, concentration and texture. 
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Table 4. Summary of results for entire sample when using input parameters specified in Tablc[2]- adaptive moments. 



all objects with a neural network point source/artifact prob- 
ability of greater than ~0.2 do indeed belong to the point 
sources/artifacts class whereas in this section we require the 
probability to be greater than 0.8. 



5.3 The Bright Sample 

In this section, we run the neural network code using the 
combined set of twelve input parameters on our bright sam- 
ple with r < 17. This allows us to perform two tests. Firstly, 
we look at whether the bright galaxies in general have better 
classifications compared to the entire sample. This is done 
by comparing Tables [5] and [9] Secondly, we train our neu- 
ral network on the bright sample and use this trained net- 



work to perform morphological classifications for the entire 
sample. This allows us to quantify the effects of magnitude 
incompleteness in the training set on the morphological clas- 
sifications. The results are summarised in Tables [9l and [lOl 
Once again the probability threshold required in the neural 
net output for classification is determined by requiring the 
percentage of contaminants to be equal to the percentage of 
genuine objects in that class that are discarded on applying 
the threshold. This optimum probability threshold is very 
similar for the bright sample to those shown in Figure [5] but 
slightly higher in all three classes when the bright galaxies 
are used for training and used to classify all galaxies. 

Comparing Tables [5] and [9] we can see that the results 
are slightly better when performing morphological classsifi- 
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Figure 5. The neural network probability of a galaxy being an early type (top), spiral (middle) and point source/artifact (bottom) 
versus the percentage of contaminants as well as the percentage of Galaxy Zoo objects in these classes that are discarded. These results 
are obtained using the combined set of 12 input parameters from Tables [TJ and [2] 
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Table 5. Summary of results for entire sample when using input parameters specified in Table [TJ and Table [2] 



cations on the bright sample with r < 17 compared to the 
entire sample with r < 17.77, in both cases using a complete 
training set. This is to be expected as it is easier to distin- 
guish between early types and spirals in a brighter sample. 
When the training is performed using an incomplete training 
set with r < 17 and all objects with r < 17.77 classified, the 
neural network still manages to perform these classifications 
with more than 90% agreement with Galaxy Zoo users. The 
magnitude incompleteness in the training set doesn't seem 
to affect the classifications as most of the input parameters 
to the neural network considered in this study have been 
chosen to be distance independent and so their distribution 
doesn't really change from a shallow to a deeper sample. 
This is promising for using automated machine learning al- 



gorithms to perform morphological classsification for future 
deep surveys using the Galaxy Zoo classifications on the 
shallower SDSS survey as a training set. 



6 CONCLUSIONS 

In this study, we have used a machine-learning algorithm 
based on artificial neural networks to perform morphologi- 
cal classifications for almost 1 million objects from the Sloan 
Digital Sky Survey that were classified by eye as part of the 
Galaxy Zoo project. The neural network is trained on 75000 
objects and using a well defined set of input parameters that 
are also distance independent, we are able to reproduce the 
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Table 6. Summary of results for gold sample when using input parameters specified in Table[T]- colours and traditional profile-fitting. 
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Table 7. Summary of results for gold sample when using input parameters specified in Table [2]- adaptive moments. 



human classifications for the rest of the objects to better 
than 90% in three morphological classes - early types, spi- 
rals and point sources/artifacts. The Galaxy Zoo catalogue 
provides us with a training set of unprecedented size for au- 
tomated morphological classifications via machine learning. 
Specifically, we draw the following conclusions: 

• Using colours and profile-fitting parameters as inputs to 
the neural network, 87% of early type classifications, 86% of 
spiral classifications and 95% of point source/artifact classi- 
fications agree with those obtained by the human eye. How- 
ever, there is some evidence to suggest that a non-negligible 
fraction of red spirals and blue ellipticals are misclassified 
by the network when using this set of parameters. 

• When parameters that rely on an adaptive weighted 
scheme for fitting the galaxy images are used, the neural 
network is unable to distinguish between early types and 
point source/artifact as these parameters are very similar 
for the two classes. 

• A combination of the profile fitting and adaptive 
weighted fitting parameters results in better than 90% agree- 
ment between classifications by humans and those by the 
neural network for all three morphological classes. This is 
approaching the success of Galaxy Zoo users in reproduc- 
ing the classifications by professional astronomers and will 
certainly surpass this with a more finely tuned set of input 
parameters than we have considered in this paper. 

• The optimum neural network probability for a galaxy 
belonging to a particular morphological class is such that the 
percentage of contaminants is equal to the percentage of gen- 
uine objects in that class that are discarded on cutting the 
sample using this threshold. This optimum probability de- 
pends both on the input parameters as well as the morpho- 
logical class of the object. We find that early types generally 
have a high optimum probability (~ 0.7) whereas the point 
sources and artifacts have a very low optimum probability 
(~0.2). Therefore the same object could be put into more 
than one class by the neural network if the classifications 
were performed using the optimum threshold probabilities. 

• For our gold sample, the early type and spiral classi- 
fications by the neural network match those by the human 
eye to better than 95%. 

• Using a bright sample to train the neural network and 
performing morphological classifications for a deeper and 



fainter sample still results in better than 90% agreement 
between the neural network and human classifications in all 
three morphological classes. This is because we have delib- 
erately chosen our input parameters to be distance indepen- 
dent. 

• However, other sources of incompleteness in the training 
sets also need to be examined before the role of the Galaxy 
Zoo data in training morphological classifiers for future sur- 
veys, can be fully understood. 

The penultimate point in particular illustrates the 
power of the machine learning algorithm in fully exploiting 
data from future wide-field imaging surveys such as the Dark 
Energy Survey, Pan-STARRS, HyperSuprime-Cam, LSST, 
Euclid etc. Such surveys will obtain images for hundreds of 
millions of objects down to very deep magnitude limits. We 
have shown that by using the wealth of information made 
available through the Galaxy Zoo project as a training set, 
the machine learning algorithm can quickly and accurately 
classify the vast numbers of objects that will make up future 
data sets into early types, spirals and point sources/artifacts. 
However, if the Galaxy Zoo catalogue is to be used as a 
training set for automated machine learning classifications 
of mergers with the next generation of galaxy surveys, a 
more r obust catalogue of visually classified mergers such as 
that o f lDarg et al. I (|2009T I needs to be used. Also, it is worth 
emphasising that the images obtained from the next gener- 
ation of wide-field surveys will need to have the necessary 
pixel size and resolution required to derive photometric pa- 
rameters such as those used as inputs to the neural network 
in this paper. 

This paper has also examined the effect of magnitude 
incompleteness in the training set on automated morpho- 
logical classifications. In the future, more work needs to 
be done on investigating other sources of incompleteness in 
the training set before this data can be used effectively to 
train machine-learning morphological classifiers for future 
surveys. For example, in this paper we have found some ev- 
idence that a non-negligible proportion of galaxies that are 
misclassified by the neural network are either red spirals or 
blue ellipticals. This misclassification is almost certainly due 
to the sparsity of such objects in the training set and needs 
to be addressed in future machine learning papers that use 
this data. We have also only used objects with a fraction 
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Table 8. Summary of results for gold sample when using input parameters specified in Table[T]and Table |21 
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Table 9. Summary of results for bright sample when using input parameters specified in Table [T] and Table [2] 



of vote greater than 0.8 from Galaxy Zoo to compare to 
our neural network classifications. In future work, we hope 
to address how excluding "intermediate" objects from our 
comparison is likely to bias our results. Nevertheless, we con- 
clude that morphological classification via machine-learning 
looks promising in allowing for many more detailed studies 
of the processes involved in galaxy formation and evolution 
with the next generation of galaxy surveys. 
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Table 10. Summary of results for entire sample when using input parameters specified in Table[T]and Tableland only bright galaxies 
with r < 17 to train the network. 
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